A Survey on Fast Distributed Algorithm on Data Mining
ثبت نشده
چکیده
Data mining is a process of discovering and extracting various models, patterns, summaries, and derived values from a given collection of data. Furthermore, it involves the use of sophisticated data analysis tools to discover previously unknown, valid patterns and relationships in large data sets. It is ordinarily practiced in a wide range of profiling practices, such as marketing, surveillance, fraud detection and scientific breakthrough. Finally, it is the process of placing a series of appropriate queries to extract information from large amounts of data in the database
منابع مشابه
Calculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms
The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...
متن کاملEmpowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure
The grid-based computing paradigm has attracted much attention in recent years. Computational Grids focus on methods for handling compute intensive tasks while Data Grids are geared towards dataintensive computing. This dissertation considers research in grid-based distributed data mining. While architectures for mining on the grid have already been proposed, the inherently distributed, heterog...
متن کاملA Technique for Improving Web Mining using Enhanced Genetic Algorithm
World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...
متن کاملTowards Parallel and Distributed Computing in Large-Scale Data Mining: A Survey
The implementation of data mining ideas in high-performance parallel and distributed computing environments is becoming crucial for ensuring system scalability and interactivity as data continues to grow inexorably in size and complexity. This paper is a survey on the parallelization of well-known data mining techniques covering classification, link analysis, clustering and sequential learning,...
متن کاملDistributed Incremental Least Mean-Square for Parameter Estimation using Heterogeneous Adaptive Networks in Unreliable Measurements
Adaptive networks include a set of nodes with adaptation and learning abilities for modeling various types of self-organized and complex activities encountered in the real world. This paper presents the effect of heterogeneously distributed incremental LMS algorithm with ideal links on the quality of unknown parameter estimation. In heterogeneous adaptive networks, a fraction of the nodes, defi...
متن کامل